🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🗣️ Speech Synthesis

Neural TTS, Voice Cloning, Real-time Audio, Kitten TTS

Cloning a Conversational Voice AI Agent from Call\,Recording Datasets for Telesales
arxiv.org·1d
🎚️Voice AI Systems
Show HN: Local, extensible and fast macOS transcription app
github.com·13h·
Discuss: Hacker News
🎙️Whisper
OpenAI Researchers Have Discovered Why Language Models Hallucinate
thealgorithmicbridge.com·9h·
Discuss: Hacker News
🏗️AI Infrastructure
Alterego: Speech Intent to Input
alterego.io·7h·
Discuss: Hacker News, r/ErgoMechKeyboards, r/ErgoMechKeyboards
🎚️Voice AI Systems
HAVE: Head-Adaptive Gating and ValuE Calibration for Hallucination Mitigation in Large Language Models
arxiv.org·1h
🏠Self-hosted AI
I built an open-source, end-to-end Speech-to-Speech translation pipeline with voice preservation (RVC) and lip-syncing (Wav2Lip).
reddit.com·2d·
Discuss: r/artificial
🎚️Voice AI Systems
MeanFlow-Accelerated Multimodal Video-to-Audio Synthesis via One-Step Generation
arxiv.org·1h
🎙️Whisper
Something to critique
onebadbit.com·3h
⏱️productivity
The Rosetta Stone of Roars: Decoding Animal Sounds with Voice AI
dev.to·2d·
Discuss: DEV
🎚️Voice AI Systems
D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning
arxiv.org·1h
🎯Vector Databases
Introduction to Nyquist and Lisp Programming
manual.audacityteam.org·9h·
Discuss: Hacker News
λFunctional Programming
NeuroBOLT: Resting-state EEG-to-fMRI Synthesis with Multi-dimensional Feature Mapping
arxiv.org·1h
🧠Neuromorphic Hardware
Show HN: Talk to any model with your team
showcase.thytus.com·6h·
Discuss: Hacker News
🏗️AI Infrastructure
TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition
arxiv.org·1h
🎙️Whisper
Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition
arxiv.org·1d
🎙️Whisper
We built an Artificial Brain that sleeps, dreams, and forms memories
github.com·14h·
Discuss: Hacker News
🧠Neuromorphic Hardware
VibeVoice: Turn Text into 90‑Minute Multi‑Speaker Podcasts
vibevoice.cc·4d·
Discuss: Hacker News
🗣️Voice Coding
LibriQuote: A Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis
arxiv.org·4d
🎚️Voice AI Systems
Refining Transcripts With TV Subtitles by Prompt-Based Weakly Supervised Training of ASR
arxiv.org·1d
🎙️Whisper
Why Language Models Hallucinate: An In-Depth Look at Model Misalignment and Mitigation Strategies (2025)
dev.to·1d·
Discuss: DEV
🏗️AI Infrastructure
Loading...Loading more...
AboutBlogChangelogRoadmap